Voice activated/voice responsive item locater

ABSTRACT

The present invention is an item location system which relies upon voice activation and responsiveness to identify location(s) of item(s) sought by a user. The system includes a continuous speech recognition digital signal processor, a programmable microprocessor interfaced therewith, voice input and user feedback mechanisms, including audio and/or video feedback. Preferred embodiments utilize audio feedback to the user. The continuous speech recognition engine utilizes Hidden Markov Models to create real time continuous speech recognition and feedback.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to voice activated/voice responsive itemlocators, i.e. item directories which direct a user such as a consumeror shopper, to a specific location to view, treat, retrieve, order,purchase or otherwise use the information obtained in the system.Typically, the present invention could be used at retail stores tolocate items to be purchased. Alternatively, it could be used at aproduction facility or distribution facility having a large number ofparts, to locate specific parts for an employee. In other embodiments,it could be used in non-commercial entities, such as a public library tolocate a particular book. The locator of the present invention reliesupon a specific software module to accomplish voice recognition andresponse, and includes manager programming for customization, updatesand modifications.

2. Information Disclosure Statement

The following prior art patents represent various inventions relating tomachines involving speech recognition for voice-based operation and thusillustrate known voice recognition applications:

U.S. Pat. No. 5,111,501 to Masanobu Shimanuki describes a telephoneterminal device equipped with a transmitter microphone, a receiver, aspeech recognition unit that receives and recognizes speech signals fromthe transmitter microphone and a circuit to reduce the level of signalssend from a telephone network to the receiver when the speechrecognition unit receives speech signals from the transmittermicrophone. Further, this device is preferably equipped with a speechreproduction unit that reproduces the speech information stored in amemory, in response to the information of recognition result from thespeech recognition unit, and a circuit that prevents transmission ofsignals from the telephone network to the receiver when the regeneratedspeech information in sent to the receiver. Furthermore, it is desirablefor this device to be provided with a circuit that prevents generationof ringing tones when an incoming call arrives.

U.S. Pat. No. 5,136,634 to David C. Rae et al. describes voice operatedfacsimile machine network which includes a method and apparatus fortransmitting specifically requested graphic and/or textual data from anunattended database storage location to a requestor's facsimile machineover a telephone line which includes a host computer such as a PCmodified with a facsimile transmission board and a voice generationboard. The host computer receives incoming phone calls and prompts thecaller using the voice board to select data files by using the DTMF keysof a standard telephone handset. The PC can be left unattended and canrun automatically in the facsimile transmission mode. Callers canimmediately access needed textual and image data with the use of just astandard telephone and facsimile machine. Multiple workstation nodes canbe configured in a network setup to handle a high volume of calls inreal time and to allow multiple data services to operate simultaneously.

U.S. Pat. No. 5,165,095 to Mark A. Borcherding describes a method fordialing a telephone, using voice recognition to initiate the dialing andto determine the correct telephone number. The dialing is initiated witha spoken dial command that is recognized by using speaker independenttemplates that are stored locally with respect to the caller'stelephone. The correct telephone number is recognized by using speakerdependent template that are downloaded from a central database or byusing speaker independent templates stored locally.

U.S. Pat. No. 5,168,548 to Steven Kaufman et al. describes a reportingsystem which is disclosed herein, a speech recognizer which is used toselect selections of text from a report form stored in a computer and toinsert recognized terms in the text hereby to generate a report textunder voice control. A command interpreter, also responsive to spokenwords, initiates creation of the report text and its subsequent storing,printing and transmission. The command processor is responsive torespective spoken commands to select a destination telephone number andto cause the report text to be sent to apparatus for converting reporttext to image data and for modulating an audio band signal with theimage data for facsimile transmission over telephone lines.

U.S. Pat. No. 5,222,121 to Keiko Shimada describes a voice recognitiondialing unit of a telephone mounted on a vehicle or similar mobile bodyand which allows a call to be originated with ease. When the user of thetelephone enters a voice command on voice inputting section, the dialingunit originates a call automatically and thereby connects the otherparty to the telephone line. In a call origination procedure, theoperations for call origination and the verifications are performedbetween the user and the unit in an interactive sequence. In a preferredembodiment, the unit has a particular call origination procedure inwhich, when the other party recognized by the unit is wrong asdetermined by the user by verification, lower place candidates for theother party are called up in response to a particular voice command. Inan alternative embodiment, the unit indicates the other party by voicinga name for verification purpose. The alternative embodiment selects andstores only the name of the other party in response to an entered voicesignal and, in the event of response for verification, combines the namehaving been stored and response information stored beforehand to producecomposite response voice.

U.S. Pat. No. 5,231,670 to Richard S. Goldbor et al. describes a systemand method for generating text from a voice input that divides theprocessing of each speech event into a dictation event and a text event.Each dictation event handles the processing of data relating to theinput into the system, and each text event deals with the generation oftext from the inputted voice signals. In order to easily distinguish thedictation events from each other and text events from each other thesystem and method creates a data structure for storing certaininformation relating to each individual event. Such data structuresenable the system and method to process both simple spoken words as wellas spoken commands and to provide the necessary text generation inresponse to the spoken words or to execute an appropriate function inresponse to a command. Speech recognition includes the ability todistinguish between dictation text and commands.

U.S. Pat. No. 5,239,586 to Kuniyoshi Marui describes a voice recognitionsystem which comprises a handset and a hands-free microphone forgenerating an input audio signal, a high-pass filter for eliminating lowfrequency components from the signal from the handset or hands-freemicrophone, a signal lever controller for adjusting the level of thehigh-pass signal in response to the user of either the handset orhands-free microphone, a storer for storing the speech data and acontroller for controlling the store so that a user's utterance isstored or the user's utterance is recognized by comparing the utteranceto speech data already stored. The handset hook switch provides anon-hook control signal to reduce amplifier gain during hands-freemicrophone operation.

U.S. Pat. No. 5,301,227 to Shoichi Kamei et al. describes an automaticdial telephone that is useable in a motor vehicle, when a voice input isprovided during a period in which input of the names of called partiesis awaited, a voice pattern of the name of the called party is comparedwith reference patterns of called parties stored in reference patternsstoring device, to determine the degree of the similarity therebetween.The names of the called parties are output to a user in the order ofdecreasing degree of similarity. Each time the name of a called party isoutput, a command word for confirmation is a waited from a user for apredetermined time period. When a voice confirmation command is inputand is recognized during this waiting period, a telephone numbercorresponding to the name of the called party is supplied to a channel.Consequently, the command word for confirmation may be input only if thename of the called party outputted is one desired by the user. Sensorscontinually monitor the driving condition of the motor vehicle in whichthe telephone is installed. When the operation of the steering wheel orbrakes of the motor vehicle exceeds a predetermined threshold or thespeed of the motor vehicle is excessive, the sensors generate safetysignals that inhibit the operation of the telephone.

U.S. Pat. No. 5,333,276 to E. Earle Thompson et al. describes acommunication system which is provided with multiple purpose personalcommunication devices. Each communication device includes atouch-sensitive visual display to communicate text and graphicinformation to and from the user and for operating the communicationdevice. Voice activation and voice control capabilities are includedwithin communication devices to perform the same functions as thetouch-sensitive visual display. The communication device includes abuilt-in modem, audio input and output, telephone jacks and wirelesscommunication. A plurality of application modules are used with personalcommunication devices to perform a wide variety of communicationfunction such as information retrievable, on-line data base services,electronic and voice mail. Communication devices and application modulescooperate to allow integrating multiple functions such as real timecommunication, information storage and processing, specializedinformation services, and remote control of other equipment into anintuitively user friendly apparatus. The system includes both desktopand hand-held communication devices with the same full range ofcommunication capabilities provided in each type of communicationdevice.

U.S. Pat. No. 5,349,636 to Roberto Irribarren describes a communicationsystem for verbal telephonic communication which has a voice messagesystem for storing and retrieving voice messages integrated with acomputer database accessing system for storing and retrieving textmessages from a separate computer system and for converting the textmessages into voice. The systems are integrated via a network whichcoordinates the functions of each individual system. Additionally, theinput/output ports of the voice message system and the computer databaseaccessing system are connected to a parallel fashion to at least onetelephone line. In this configuration a user may access both voicemessages and database information, including text or electronic mailmessages, with a single telephone call. Optionally, facsimile messagescan be stored, retrieved and manipulated with a single telephone call.

U.S. Pat. No. 5,406,618 to Stephen B. Knuth et al. describes a telephoneanswering device that is activated by a proximity sensor when a usercrosses its field of detection and whose operation is controlled bysimple voice commands. The device incorporates speaker-independent voicerecognition circuitry to respond to spoken commands of the user that areelicited by a system generated voice request menu. The telephoneanswering device performs all the basic functions of a telephoneanswering machine in response to these simple commands and there is noneed for the user to manually operate the telephone answering device.

U.S. Pat. No. 5,602,963 to W. Michael Bissonnette et al. describes asmall, portable, hand-held electronic personal organizer which performsvoice recognition on words spoken by user to input data into theorganizer and records voice messages from the user. The spoken words andthe voice messages are input via a microphone. The voice messages arecompressed before being converted into digital signals for storage. Thestored digital voice messages are reconverted into analog signals andthen expanded for reproduction using a speaker. The organizer is capableof a number of different functions, including voice training, memorecord, reminder, manual reminder, timer setting, message review,waiting message, calendar, phone group select, number retrieval, addphone number, security and “no” logic. During such various functions,data is principally entered by voice and occasionally through use of alimited keypad, and voice recordings are made and played back asappropriate. A visual display provides feedback to the user. during thevarious function, the user can edit various different data within theorganizer by eliminating or correcting such data or entering new data.

U.S. Pat. No. 5,621,658 to Brion K. Jackson describes an actioncontained within an electronic mail object which is communicated from adata processing system to another data processing system via an audiodevice. The action is executable on a data processing system. At thesending data processing system, the action is converted to apredetermined audio pattern. The electronic mail object may contain textin addition to an action. The text is also converted to an audiopattern. The audio pattern are then communicated to the audio deviceover telephone lines or other communication medium. At the receivingend, the audio device records the object. A user can provide therecorded object to a data processing system, when then executes theaction and converts the text audio patterns back to text. In addition,the action can be converted to text and displayed on the data processingsystem.

U.S. Pat. No. 5,631,745 to John J. Wong et al. describes a telephoneterminal adapted for business or home use that includes the ability toreceive and send facsimiles, a voice answering function and a computermodem. Various input and output devices may be used for the facsimilefunction. A voice annotated facsimile may be sent and received. At thesame time the facsimile is viewed on a video monitor or ordinarytelevision set, an accompanying voice message is heard through thesecond system of the monitor or television set. The terminal has anarchitecture including a central processor and an internal bus structureto which several types of memory, various input-output devices and aninterface with the telephone line are connected, among others. AudioRandom Access Memory (ARAM) is used for storing both facsimile data andvoice data.

U.S. Pat. No. 5,671,328 to Gregory P. Fitzpatrick et al. describes amethod and data processing system which are disclosed for automaticallycreating voice processing template entities. In one embodiment, theinvention automatically assembles a plurality of commands received bythe data processing system, at least one of said commands having a voicerecognition criteria component associated therewith, counts theoccurrences of the plurality of commands, assembles voice recognitioncriteria components associated with the plurality of commands, and, as aresult of the occurrence count exceeding a predefined minimum,constructs a voice recognition template entry by associating theassembled voice recognition criteria components with the assembledplurality of commands.

U.S. Pat. No. 5,850,627 to Joel M. Gould et al. describes a wordrecognition system which can: respond to the input of a character stringfrom a user by limiting the words it will recognize to words having arelated, but not necessarily the same, string: score signals generatedafter a user has been prompted to generate a given word against wordsother than the prompted word to determine if the signal should be usedto train the prompted words; vary the number of signals a user isprompted to generate to train a given word as a function of how well thetraining signals score against each other or prior models for theprompted word; create a new acoustic model of a phrase by concatenatingprior acoustic models of the words in the phrase; obtain informationfrom another program running on the same computer, such as its commandsor the context of text being entered into it, and use that informationto vary which words it can recognize; determine which program unit, suchas an application program or dialog box, currently has input focus onits computer and create a vocabulary state associated with that programunit into which vocabulary words which will be made active when thatprogram group has the focus can be put; detect the availablecomputational resources and alter the instructions it executes inresponse; test if its ability to respond to voice input has been shutoff without user confirmation, and, if so, turn that ability back on andprompt the user to confirm if that ability is to be turned off; storeboth a first and a second set of models for individual vocabulary wordsand enable a user to selectively cause the recognizer to disregard thesecond set of models for a selected word; and/or score a signalrepresenting a given word against models for that word from differentword model sets to select which model should be used for futurerecognition.

Nothwithstanding the prior art, the present invention is neither taughtnor rendered obvious thereby.

SUMMARY OF THE INVENTION

A voice activated/voice responsive item locator system is disclosed toenable a user to speak into the system and have the system respond withlocation information for an item requested by the user. For example,shopper at a home supply store may pick up a locator phone or just speakinto a wall mounted or otherwise situated microphone and say “LocateOutdoor Paint” or “Find Hammers” of simply state what is sought withoutthe use of a verb, e.g. “Caulking”. The system may reply either withvoice or visual (words on a screen, or map), or both voice and visual,e.g. “Aisle 3, Shelf 4”. In some instances the system will reply, forexample, with a “Repeat”, or “Restate in different words” or “Pleasetalk to information desk” or other default instructions.

The locator system may be a stand alone device, but in most embodimentswould be part of an internal connected system. It could be an intranetor secured internet system, but would in many cases be a storewidesystem with a plurality of user locations (units, phones, ormicrophones, with feedback at each location). The system will include anembedded voice-driven interface for speech control of: (1) operationalinstructions; (2) core system locator function operations, that is,recognition of specific requests and responses thereto; and, (3)optional and default functions. In preferred embodiments, the presentinvention device is both operated by speech (speech or voice activated)and speech responsive (voice answers and instructions to the user fromthe system). Thus, the present invention device relies upon automaticspeech recognition (ASR), either in place of or in addition to manuallocator systems, e.g. books, list, map and computer directories. In someembodiments, user feedback features are included wherein both audio andvisual feedback is given to a user in response to recognizable voicesignals, while in other possible embodiments, the user may designateaudio or visual.

BRIEF DESCRIPTION OF THE DRAWINGS

The present invention should be more fully understood when thespecification herein is taken in conjunction with the drawings appendedhereto wherein:

FIG. 1 shows a general schematic diagram showing software and functionalfeatures of a present invention item locator system;

FIG. 2 shows a schematic diagram illustrating the physical functions ofa present invention voice recognition item locator device; and,

FIG. 3 shows a schematic diagram of a present invention deviceillustrating details of a voice recognition submodule.

DETAILED DESCRIPTION OF THE PRESENT INVENTION

The present invention is a voice activated/voice responsive item locatorand system. By “item” is meant a place or thing that a user desires tolocate. Thus, a item could be a particular brand of canned string beans,a type of outdoor stain, a booth at a convention, a particular part ininventory for sale, assemblage or distribution, a particular automobilein a production facility lot or in a large parking garage, or a room, afunctional group or a person in an office building or the like. Theresponse may be in the form of a word or sentence presented visually oraudibly and it may designate an aisle, a shelf, a bin number, a roomnumber, a row and slot or space, etc.

The voice recognition system digitizes words spoken via a receiver(microphone) handset, headset, or built-in microphone for conversionfrom analog to digital utilizing a continuous speech recognition digitalsignal processor (DSP). The main support structure may be a conventionaltype housing for phones and other communication devices, may be of adifferent shape or configuration or may be built into a device such as awall or desk unit, with or without monitor. They could be portable orpermanently affixed and could be powered by any means available, e.g. ACor DC current. In the portable mode, the system would be wireless forthe user and would, in that respect operate like a cell phone, two wayradio, “walkie talkie” or other short distance wireless device, butwould have a processor at a central or fixed location having the samefeatures as described above, i.e., the DSP with programmingcapabilities, etc.

The DSP is connected to a programmable microprocessor and either bycustomized input or a standard program, the system enables the user toquickly enter voice-activated fields, e.g,, such as “Where is . . . ”,“Find . . . ”, etc. Verification of voice recognition accuracy (prior toexecution) is optional and may be accomplished via synthesized voiceplayback and/or a screen confirmation which requires a “YES” or “NO” toexecute or open for revision. In some preferred embodiments, a screen,e.g., LCD, enables visual feedback during input phase, with support fordetection, insertion, correction, etc. Cancellation of the entirecommand or programming instructions may be possible at any time (priorto execution), via keystroke or voice command.

The essential features of the present invention involve the creation ofa voice based guide or locator to offer enhanced convenience and speedto users for location of one or more items.

FIG. 1 shows a general schematic diagram of a present invention systemshowing general software features and functional features. Thus, thepresent invention device includes a central processor 1 which may be anexternal or internal component, i.e., within a single unit or at aseparate location from audio receivers and transmitters, e.g.,microphones/speakers for user inputs and feedback to users.

The system may be preprogrammed with the user being required to followconcise instructions for activation and operation, or may beprogrammable to alter, add or enhance ease or methods of use, e.g.through a limited access code, for manager inputs 3 of userinstructions. In any event, manager inputs 3 shall include functionalselections and inputs of items and their locations, with provision forsubsequent access for modification. This programming may include directkeyboard, voice, etc., and, as mentioned, may include securitycapabilities for preventing unauthorized use, e.g., voice identification(user recognition) or user security code system, as well as otheroptions which may be included therein, such as a “help” detailed managerinstruction section.

Once the system has been programmed for use, the user operation unit(s)5 provide functional access, which may be passive, i.e., the userspeaks, picks up a phone, presses a button, or otherwise takes someaction to activate the system; or it may be active, i.e., a proximitysensor, a periodicity timer, or other internal mechanism mayautomatically activate the system and could trigger an audio or visualquery, such as “May I help you locate a product?”

Once the system has been activated and a user has stated the necessarywords of input to activate the device, recognition/non-recognitionresponse 7 results from processing the user inputs to central processor1, and audio and/or video response unit(s) 9 provide feedback 11 to theuser, either by answering the inquiry, conditionally defaulting, e.g.,asking for a repeat or a restate the question, or fully defaulting e.g.directing the user to a courtesy desk or check out counter for help.

FIG. 2 shows a schematic diagram illustrating a present invention voiceactivated/voice responsive item locator system, showing the physicalarrangement and function of components. Thus, symbol 17 indicates anoptional user prompter proximity sensor and symbol 21 is a microphone orequivalent component for voice input. The voice input to sent to audiocontroller 19 an to automatic speech recognition unit 23 and isconverted from analog to digital signals. The speech recognition 23communicates with a continuous speech signal recognizer 41 and acontinuous speech signal interpreter 43. CPU/Memory 25 compares thedigital signal to the set up or dictionary of digital words or phrasesin memory. Once a match is found, the system processor 27 and datastorage 31 operate to respond with an answer or a default instruction ora query by providing digital text to text-to-speech generator 29, whichprovides audio feedback to a user via audio controller 19 and speaker 33Feedback to a user may also be provided on visual screen 37 via displaycontroller 35. Keypad 39 is used for manager set up and modifications.

FIG. 3 shows the details of one preferred embodiment of the submoduleused in the present invention device. The voice recognition componentconverts an acoustic signal into a subsequence of labels. The systemtakes the raw acoustic data, and processes it through the recognizer.The recognizer then matches it against a set of models using a decoderthat generates a recognition token. This token represents what the usersaid as either a single word or utterance. The recognizer itself doesnot interpret the meaning of the recognized output, that is the functionof the interpreter (described later). The recognizer uses Hidden MarkovModels (HMMS) to provide for a continuous speech recognition engine.HMMs do not process the acoustic signal directly but instead split thesignal into a sequence of discrete observations. These observations arederived from a digital representation of the signal that had beenconverted from the analog signal generated by the microphone. Duringrecognition, the likelihood of each model (or sequence of models)matching the incoming signal is calculated. The recognizer simplyselects the most likely model to decode the signal. As this is donecontinuously, the recognizer can process speech as opposed to isolatedwords, allowing the user to talk more naturally.

Each acoustic model represents a short sound. The interpreter combinesthese sounds into words using a dictionary. This dictionary specifiesthe pronunciation of each word in terms of the acoustic models. Afteridentifying the most likely word, the interpreter then joins sets ofmodels together (using a Viterbi decoder) in a series of pre-definedconnections such that paths can be established to provide for a degreeof “natural language” recognition; in other words, the user can say“Find hammers”, “Where are hammers” or “hammers” and they are allunderstood to mean the same thing. Moreover, these sets of models anddictionaries are interchangeable, allowing the same voice recognitioncomponent to be used in a variety of applications.

As the voice recognition component is running continuously, there needsto be a way onto distinguish background conversations that mightaccidentally trigger an unwanted action by the device. For example, twopeople standing by a voice-activated device might be discussinglocations of different goods in a supermarket and be misinterpreted orundesireably resonded to. To avoid this problem, the recognition unitrequires a command word to trigger before beginning further recognition.The trigger word is a user-definable setting.

Thus, in FIG. 3, initialization 51 initiates monitoring 53 for a triggerword from a user. When a word is received, it is analyzed to determinewhether or not a trigger word 55 has been received. If not, signal 57returns the status to monitoring 53 for a new word. This loop continuesuntil a trigger word is recognized and an inactivity timer 59 isstarted. The monitor 61 proceeds with the monitoring for the next wordand waits for timer pop 65. When an event 63 is received, timer pop 65returns to the monitor 53 to continue the monitoring process and thevoice data is sent to interpretation 67. If it is understood 69, anaction 75 if process and feedback function 77 is performed.Additionally, signal 79 prompts user 71. Likewise, if the interpretationis not understood 69, user 71 is prompted and via signal 73, timer 59begins again. These cyclings operate on a continual basis while thesystem is initiated. Voice activation may also be used to shut down thesystem.

Obviously, numerous modifications and variations of the presentinvention are possible in light of the above teachings. It is thereforeunderstood that within the scope of the appended claims, the inventionmay be practiced otherwise than as specifically described herein.

1. An item locator system having both voice activation and voiceresponsive capabilities for location feedback to locate one or morespecific goods in a retail store facility, which comprises said systemcomprising: a.) a support structure, for physically supporting saidsystem at one or more locations, and functionally containing orconnected to the following components: b.) a continuous speechrecognition digital signal processor (DSP), wherein said continuousspeech recognition engineDSP utilizes tokens of raw acoustic signalsrepresenting utterances or words and matches these against a set ofmodels and then relies upon likelihood to select a most likely model todecode signals for interpretation; c.) a programmable microprocessorinterfaced with, coupled to said speech recognition DSP; , d.)sufficient programming and circuitry contained within said programmablemicroprocessor to provide for voice activation and voice recognitionandaudio and/or visual response to provide item location to a userwherein item and location data are defined by manager input to saidsystem ; e.) voice input means connectedcoupled to said speechrecognition DSP; f.)at least one memory storage means connected to,coupled to said programmable microprocessor and/or said DSP, for storageof operational inputs, control inputs, and a voice recognitionvocabulary for storage of command match and execute functions; and g.)at least one user feedback unit and connection from, coupled to saidprogrammable microprocessor to said at least one user feedback unit,said at least one user feedback unit ,adapted to provide audio and/orvisual feedback selected from the group consisting of audio feedback,visual feedback and combinations thereof, to asaid user in response toan item location query.
 2. The system of claim 1 wherein said userfeedback unit includes visual display means for viewing visual feedbackin the form of text, or a map, or a combination thereof.
 3. The systemof claim 1 wherein said user feedback unit includes sufficient hardwareand software to provide audio feedback to a said user in response torecognizable voice input.
 4. The system of claim 1 wherein said at leastone memory storage means further includes flash ROM storage and providesfor remote diagnostics and system programming.
 5. The system of claim 1wherein said voice input means includes a microphone.
 6. The system ofclaim 1 which further includes including a secured manual control panelfor input and management of item and location data into said system. 7.The system of claim 6 wherein said manual control panel furthercontains: a keypad and menu for operation and programming options, amicrophone, and a screen for input and feedback display.
 8. The systemof claim 1 which additional components further includes wherein saiduser feedback unit comprises: an audio feedback component which includesincluding audio feedback hardware and a software adapter to audiblyrespond to recognizable voice input, including wherein said audiofeedback component uses digital-to-analog conversion; and an outputspeaker to provide audio feedback to said user.
 9. The system of claim 1wherein said DSP includes a continuous speech recognition engine havinga continuous speech signal recognizer and a continuous speech signalinterpreter.
 10. The system of claim 1 wherein said programing andcircuitry within said programmable microprocessor includes an embedded,voice-driven interface for control of operational instructions, systemlocator function operations, and option and default functions.
 11. Thesystem of claim 1 wherein said response to provide item location to auser includes aisle location.
 12. The system of claim 1 wherein saidresponse to provide item location to a user includes shelf location. 13.The system of claim 1 wherein said response to provide item location toa user includes aisle and shelf location.
 14. The system of claim 1wherein said response to provide item location to a user includes binnumber.
 15. The system of claim 1 wherein said response to provide itemlocation to a user includes row and slot location.
 16. The system ofclaim 5 wherein said microphone is selected from the group consistingof: a receiver handset, a headset, and a built-in microphone.
 17. Thesystem of claim 1 wherein said support structure is a portable supportstructure further comprising a support structure, for physicallysupporting at least a portion of said item locator system, andfunctionally containing or connected to said voice input means and saidat least one user feedback unit.
 18. The system of claim 1 wherein saidspeech recognition engine uses DSP comprises a continuous speechrecognition engine operable to use Hidden Markov Models for itscontinuous speech recognition engine .
 19. The system of claim 6 whereinsaid speech recognition engine uses Hidden Markov Models for itscontinuous speech recognition engine DSP comprises a continuous speechrecognition engine operable to use Hidden Markov Models.
 20. The systemof claim 8 wherein said speech recognition engine uses Hidden MarkovModels for its continuous speech recognition engine DSP comprises acontinuous speech recognition engine operable to use Hidden MarkovModels.
 21. An item locator system for location feedback to locate oneor more specific items in a facility, comprising: at least one processoroperable to receive acoustic signals representing utterances or words,match the acoustic signals against a set of models, and select one ofthe set of models to decode the acoustic signals, wherein the at leastone processor is further operable to detect a trigger word for continuedinterpretation of the utterances or words; at least one memory coupledto at least one processor for storage of operational inputs and a voicerecognition vocabulary of command match and execute functions; and atleast one user feedback circuit coupled to the at least one processor,wherein the at least one user feedback circuit is operable to providevisual or audio feedback to a user in response to an item locationquery.
 22. A locator system comprising: a local processor residing in afacility; a database, coupled for access by the local processor, storinglocation information for each of a plurality of items in the facility; aremote processor, coupled to communicate with the local processor,operable to update at least a portion of the database; a voice-driveninterface, coupled to the local processor, for accepting acousticsignals corresponding to a request by a user for the location of atleast one of the plurality of items; and a controller, coupled to thelocal processor, to provide audio or visual signals to the userregarding the location of the at least one of the plurality of items.23. The locator system of claim 22 wherein the database resides remotelyfrom the local processor.
 24. The locator system of claim 22 wherein:the facility is a retail store; and the item is a consumer item.
 25. Alocator system comprising: at least one processor for processingacoustic signals corresponding to a request by a user for the locationof at least one of a plurality of items in a facility; a database,coupled to the at least one processor, storing location information foreach of the plurality of items; and at least one memory, coupled to theat least one processor, for storage of an originally active vocabularyand a voice recognition vocabulary of command match and executefunctions, wherein the at least one processor is configured to accessthe at least one memory when processing the acoustic signals, and the atleast one processor is configured to access the database to provideinformation to the user regarding the location in the facility of the atleast one of the plurality of items.
 26. The locator system of claim 25wherein the at least one memory is programmable by a manager input. 27.The locator system of claim 25 wherein processing the acoustic signalscomprises using Markov models.
 28. A locator system comprising: aplurality of user locations, each comprising a local processor, locatedin a facility; a database, coupled for access by the local processor ofeach of the plurality of user locations, storing location informationfor each of a plurality of items in the facility; and wherein each ofthe plurality of user locations further comprises: a voice-driveninterface for accepting acoustic utterances or words from a user torequest the location of at least one of the plurality of items; at leastone memory with a voice recognition vocabulary for storage of commandmatch and execute functions; and a controller to provide audio or visualsignals to the user regarding the location of the at least one of theplurality of items.
 29. The locator system of claim 28 furthercomprising a remote processor, coupled to communicate with each of theplurality of user locations, operable to update at least a portion ofthe database through manager inputs provided to the remote processor.30. The locator system of claim 28 further comprising a map to providethe location of the at least one of the plurality of items to the user.31. The locator system of claim 28 wherein the at least one memoryfurther stores an originally active vocabulary.